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A Computer System And a Method for 
Segmentation of a Digital Image 



Field of the invention 

[0001] The present invention relates to the field of 

segmentation of digital images, in particular medical digital 
images, and more particularly to the field of visualization of 
segments corresponding to anatomical structures and to the 
measurement of the volumes of such anatomical structures. 

Background and prior art 

[0002] Segmentation is a crucial task in visualization 

and quantification based on digital medical image data. The 
number of digitally stored medical images and the amount of 
information stored in each image both are steadily increasing, 
thus increasing the importance of robust and fast segmentation 
methods at the same time. Robustness refers to the ability of 
the method to: (i) work in presence of image disturbance, (ii) 
with a low failure rate, and (iii) to work for a wide range of 
different images and patients. 

[0003] Still many specific segmentation problems persist 

that have not been solved satisfactorily. Each problem is 
defined by: (i) the imaging modality, (ii) the imaged organ, and 
(iii) the task to be performed on the images. Examples for such 
segmentation problems are removal of non-brain tissue in three- 
dimensional (3D) magnetic resonance (MR) images of the human 
head, or segmentation of left and right ventricle in cardiac MR 
cine images on all slices and time points. 



[0004] The prior art shows a variety of segmentation 

methods : 



Manual Segmentation 

[0005] Manual tracing for object segmentation is usually 

performed on all two-dimensional slices of a three-dimensional 
image dataset that contain the structure of interest. Manual 
interaction is preferably applied on an intermediate level of 
detail, where local and global image features have to be taken 
into account. In these cases , manual interaction often yields 
more accurate results than fully automated methods. 

[0006] For example, when tracing the brain contours on 

one 2D slice that is part of series of MR images, a trained 
operator combines detailed knowledge about the individual 
anatomy (such as brain, cerebellum, peripheral nerves and 
neighboring blood vessels) . 

[0007] Segmentation problems of this kind are suited for 

manual processing. Even within the framework of semi-automated 
methods for the, e.g., histogram based quantification of 
different tissue types, manual tracing is the preferred method 
for the selection of an appropriate region of interest (ROI) . 

[0008] U.S. Pat. No. 5, 262, 945 shows such a method for 

quantification of brain volume from magnetic resonance images. 

[0009] There are several problems with manual 

segmentation preventing it from being routinely used in many 
cases: (i) Interaction and time costs (manual tracing takes 
several minutes per image), (ii) requirement for sufficient 
training (the segmentation result is highly dependent on the 
level of training) , (iii) requirement for care and concentration 
during segmentation (quality and precision of human interaction 
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is not constant) , and (iv) reduction of objectivity (different 
operators will generate different results) . 

[0010] The growing number of digitally stored medical 

images as well as the advances in scanner resolution, combined 
with high personnel costs are another reason for manual 
segmentation to become even less applicable in the future. At 
the same time, the need for efficient and reliable automated or 
semi-automated segmentation methods is growing. 

Thresholding and Morphology 

[0011] The second group of segmentation methods is 

thresholding in combination with morphology, as described in 
K.H. Hoehne and W.A. Hanson, "Interactive 3D segmentation of MRI 
and CT volumes using morphological operations", J Computer 
Assisted Tomography 16(2): 185-294, 1992. 

[0012] As a first step, the structure of interest is 

roughly separated from the background by simple gray scale 
thresholding, resulting in a binary image. In the brain 
segmentation problem on Tl-weighted MR images, very bright (e.g. 
fat) and very dark (e.g. bone, air) parts of the image are 
suppressed using appropriate thresholds. In some cases, the 
brain is represented by the largest remaining connected 
structure. This structure in the binary image can be applied as 
a mask to the original data. 

[0013] In most cases, however, the structure representing 

the brain in the binary image is connected to neighboring 
structures (caused by nerves, blood vessels, image non- 
uniformity, or limitations of image resolution) . In order to 
separate the brain from other structures, morphological filter 
operations can be used; morphological opening suppresses thin 
structures whilst preserving thicker ones. L. Lemieux, G. 
Hagemann, K. Krakow, and F.G. Woermann, "Fast, accurate, and 
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reproducible automatic segmentation of the brain in Tl-weighted 
volume MRI data", Magnetic Resonance in Medicine 42(1): 127-35, 
Jul 1999, describe a series of thresholding and morphological 
operations that aims to solve the shortcomings of this simple 
approach . 

[0014] Another application of thresholding is described 

in U.S. Pat. No. 5,903,664. For fast segmentation of cardiac 
images, a ROI and a seed point within the ROI are manually 
selected. An operator-defined threshold is used to acquire a 
first segmentation of, e.g., the left cardiac ventricle on that 
slice. This segmentation is defined as a set of connected pixels 
showing gray values above the threshold and including the seed 
point. From that first segmentation, a histogram is computed. A 
further threshold is selected which separates modes of the 
histogram and is used to carry out a revised, final segmentation 
of the current image. The centroid of the current segmentation 
is used as a seed point in segmenting adjacent images. 

[0015] Within the framework of thresholding and 

morphological operations, the user normally interactively 
specifies thresholds, filter sizes, or the number of iterations, 
in order to produce a correct segmentation. A fully automated 
system of thresholding and morphology tends to fail in many 
cases, e.g., when the image acquisition parameters vary. 
Furthermore, the results often need final manual editing. 

[0016] The results presented by Lemieux et al . (1999) 

show a progress in terms of automated thresholding and 
morphological operations. However, this method is tuned to a 
narrow range of image types. In particular, the result is 
dependent on the overall homogeneity of the image. The algorithm 
will fail in presence of moderate contrast non-uniformity, since 
no threshold can be found that is appropriate for all parts of 
the image . 
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[0017] In general, this weakness against image 

inhomogeneities is characteristic for all segmentation methods 
comprising global thresholding. 

Statistical Methods 

[0018] U.S. Pat. No. 5,425,368 describes a method for 

estimating tissue volumes in MR images. More precisely, a fuzzy 
logic approach to the problem of distinguishing cerebrospinal 
fluid, gray and white matter pixels is presented. An 
unsupervised fuzzy clustering procedure based on a variant of 
the fuzzy c-means algorithm computes the percentage of each of 
these three compartments in each image automatically, with 
virtually no operator intervention. Each volume element 
represented in the image can belong to all compartments in 
varying degrees. The procedure requires input of the number of 
different compartments in the image, as well as a parameter 
which determines the amount of overlap of compartment 
boundaries. The method has been applied to the diagnosis of 
hydrocephalus . 

[0019] Statistical segmentation methods are useful, but 

target another class of segmentation problems. In general, 
statistical methods only converge to an appropriate solution 
after the original image has been segmented, so that a small set 
of known tissue types remains plus additional noise and non- 
uniformity. A statistical method could be, for example, a 
subsequent step to brain segmentation, also referred to as skull 
stripping, i.e. the removal of non-brain tissue in 
neuroanatomical images. Conversely, statistical methods such as 
described in U.S. Pat. No. 5,425,368 are not applicable to the 
original MR images directly. 

Deformable Template Based Segmentation 



[0020] The third group of segmentation methods uses 

deformable templates. From A.M. Dale, B. Fischl, and M.I. 



- 6 - 

Sereno, "Cortical surface-based analysis I: Segmentation and 
surface reconstruction", Neurolmage 9: 179-194, 1999, a surface 
model in terms of a tessellated mesh is known. This model is 
fitted to the brain surface in Tl-weighted MR images in order to 
separate the brain from surrounding tissue. During the fitting 
process, two constraints are applied to the surface geometry: 
(i) A smoothness term (avoiding unnaturally folded surfaces), 
and (ii) an image term (keeping the surface close to the desired 
structure, here the brain surface) . 

[0021] During the fitting process, the surface is 

deformed by iteratively applying these constraints. Since the 
constraints take into account neighborhood information, and this 
information is iteratively propagated over the surface, 
deformable templates are suited to solve non-local segmentation 
problems. Deformable templates seem to be more robust and easier 
to automate than thresholding and morphology. 

[0022] The goal of Van der Geest et al (R.J. van der 

Geest, V.G.M. Buller, E. Jansen, H.J. Lamb, L.H.B. Baur, E.E. 
van der Wall, A. de Roos, and J.H.C. Reiber, "Comparison between 
manual and semi-automated analysis of left ventricular volume 
parameters from short-axis MR images", J Computer Assisted 
Tomography 21(5): 756-765, 1997) is to evaluate a semi-automated 
template based contour detection algorithm for the quantitative 
analysis of cardiovascular MRI . Left ventricular function 
parameters derived from automatically detected endocardial and 
epicedial contours were compared with results derived from 
manually traced contours in short-axis multislice GRE MRI 
studies of 10 normal volunteers and 10 infarct patients. 
Compared with manual image analysis, the semi-automated method 
resulted in smaller systematic and random differences for 
volume, ejection fraction, and wall thickness of the left 
cardiac ventricle. 
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[0023] A problem with template driven segmentation is 

caused by geometrical or anatomical variations. For pathological 
cases , e.g., templates often are not applicable. Interaction 
techniques integrated in template based segmentation algorithms 
can only serve to constrain parameters, but not to deal with 
variations beyond the model. 

Segmentation Based on the Watershed Transformation (WT) 

[0024] The aim of Sijbers et al. (1997) (J. Sijbers, P. 

Scheunders, M. Verhoye, A. Van der Linden, D. Van Dyck, and E. 
Raman, ''Watershed-based segmentation of 3D MR data for volume 
quantization", Magnetic Resonance Imaging 15(4), 1997) is the 
development of a semi-automatic segmentation technique for 
efficient and accurate volume quantization of Magnetic Resonance 

(MR) data. The proposed technique uses a variant of an immersion 
based watershed algorithm which is applied to the gradient 
magnitude of the MR data and which produces small volume 
primitives . 

[0025] The known drawback of the watershed algorithm, 

oversegmentation, is reduced by a priori application of a 3D 
adaptive anisotropic diffusion filter to the MR data. 
Furthermore, oversegmentation is a posteriori reduced by 
properly merging small volume primitives which have similar gray 
level distributions. The outcome of the preceding image 
processing steps is presented to the user for manual 
segmentation. Through selection of volume primitives, the user 
quickly segments the first slice which contains the object of 
interest. Afterwards, the subsequent slices are automatically 
segmented by extrapolation. 

[0026] U.S. Pat. No. 5,463,698 shows a fast implementation 

of the watershed transformation (WT) with markers based on 
hierarchically organized queues. The problem that is solved by 
the present invention, being the separation of structures that 
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exhibit similar gray scale characteristics, is not touched by 
this realization. 

[0027] Hahn et al (H.K. Hahn and H.-O. Peitgen, " x The 

Skull Stripping Problem In MRI Solved By A Single 3D Watershed 
Transformation", Medical Image Computing and Computer-Assisted 
Intervention (MICCAI 2000), Pittsburgh, Springer LNCS : 134-143, 
2000) , the disclosure of which being expressly incorporated 
herein by reference, have been the first to describe the direct 
application of the WT to a three-dimensional medical imaging 
problem. No gradient or edge information is derived prior to the 
trans formation . 

[0028] A priori, it is not clear that such a strategy 

could produce the desired results. However, there is a class of 
problems that can be solved by the direct application of the WT . 
In these cases the main problem consists of separating an object 
of interest from neighboring structures that exhibit a similar 
image intensity, but are separated by a thin layer of slightly 
darker or brighter tissue. In Hahn et al. (2000) the brain was 
separated in Tl-weighted MR images, much as in Dale et al. 
(1999), but with an increase in robustness. 

[0029] Watershed segmentation offers promising approaches 

to robust segmentation of images. The WT can be directly 
extended to 3D and even 4D and higher dimensional problems. 
However, the gradient information of an image, to which the WT 
is applied in accordance with the prior art, is only suited to 
yield appropriate results if the object borders exhibit image 
gradients that are higher than the gradients inside the object. 

Summary of the invention 

[0030] It is an object of the present invention to 

provide a novel method, computer program product and computer 
system for segmentation of a digital image. 



[0031] It is another object of the present invention to 

provide for a segmentation procedure that is robust even against 
considerable image non-uniformity as well as image noise. 

[0032] It is a further object of the present invention to 

provide a segmentation procedure that preserves the full 
original image information. 

[0033] It is a further object of the present invention to 

provide a segmentation procedure with improved user interaction 
techniques . 

[0034] It is a further object of the present invention to 

provide a segmentation procedure using no or only a small number 
of geometrical assumptions, such as anatomical a priori 
knowledge . 

[0035] It is a further object of the present invention to 

provide a segmentation procedure featuring fully automatic 
segmentation, especially for high-throughput applications. 

[0036] It is a further object of the invention to provide 

a segmentation procedure enabling interactive semi-automatic 
segmentation . 

[0037] It is a further object of the present invention to 

provide a segmentation procedure which is fast and with 
computational costs that are approximately proportional to the 
number of picture elements, especially in order to be able to 
deal with increasing image resolutions. 

[0038] It is a further object of the present invention to 

provide a segmentation procedure which enables an accurate 
measurement of the volume of a certain tissue type based on 
histogram analysis. 
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[0039] These and other objects of the invention are 

achieved by a method, computer program product and computer 
system which employ a watershed transformation method directly 
on the picture element data. This compares to the prior art 
where watershed transformation is used with respect to the 
gradients of the picture element data. As a result of the 
application of a watershed transformation directly on the 
picture element data one or more basins are identified in the 
digital image to be segmented. These basins do not necessarily 
correspond to the anatomical structures to be segmented. 

[0040] Therefore a post-processing of the watershed 

transformed digital image is performed in accordance with the 
invention. By means of the post-processing a basin of interest 
is reduced to the anatomical structure of interest to be 
segmented. 

[0041] In accordance with a preferred embodiment of the 

invention this is accomplished by applying a transfer function 
directly onto the picture element data corresponding to the 
basin of interest. 

[0042] In accordance with a further preferred embodiment 

of the invention a scalar value is determined for each picture 
element data by applying a transformation function. The 
transformation function takes only into account the picture 
element data to be transformed but not - for example- 
neighboring picture element data. In this preferred embodiment 
the watershed transformation is then applied onto the scalar 
values of the transformed picture element data. 

[0043] In accordance with a further preferred embodiment 

of the invention the digital image to be segmented is a medical 
image, such, as a computer tomographic (CT) , a magnetic 
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resonance (MR) a Tl-weighted MR image or any other kind of 
digital radiologic image. 

[0044] In accordance with a further preferred embodiment 

of the present invention the image data is three-dimensional or 
four-dimensional. In the latter case the fourth dimension is 
time . 

[0045] It is a particular advantage of this embodiment 

that this way it becomes possible to visualize movements of body 
organs, such as a pulsing heart, or the flow of body fluids over 
time . 

[0046] In accordance with a further preferred embodiment 

of the invention the post-processing is used to transform the 
transparency of picture elements in a region between a first 
tissue type of interest and a second tissue type in accordance 
with a transfer function in order to gradually increase the 
transparency of those picture elements. This way a structure of 
a second tissue type surrounding or enveloping a structure of 
interest of a first tissue type becomes transparent so that the 
structure of interest of the first tissue type is visualized. In 
accordance with a particular embodiment of the invention a four- 
dimensional color space having a transparency coordinate, such 
as the RGBA color coordinates, is used. 

[0047] In accordance with a further preferred embodiment 

of the invention the volume of an anatomical structure is 
measured by histogram analysis of the picture element data 
belonging to the segment containing the tissue type of interest. 

[0048] The present invention is particularly useful for 

applications such as: 

a. skull stripping, 

b. brain segmentation with estimation of brain volumes, 
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c. segmentation of the cardiac ventricles in cardiac 
images, 

d. cardiac segmentation with estimation of cardiac 
ventricle volumes , 

e. segmentation of the cerebral ventricles, i.e. the 
intra-cerebral CSF spaces, and to 

f . segmentation of the cerebral ventricles with accurate 
ventricular volumetry, 

g. bone segmentation in CT images, 

h. segmentation and accurate volumetry of lung nodules 
and similar tumors. 

[0049] An essential element of the invention is the 

watershed transformation (WT) , which is directly applied to the 
picture element data or the corresponding scalar values. This is 
in contrast to the prior art where the gradient information of 
an image is watershed transformed. 

[0050] Similar to thresholding and region growing 

techniques, the WT partitions an image into regions, so-called 
basins. However, there is not a specific gray value, e.g. a 
threshold, that determines the basin borders. Watersheds, being 
the borders of the basins resulting from the WT, are throughout 
constructed at connected local image maxima (so-called crest 
lines). Connected local image minima (so-called valleys), can be 
transformed into crest lines via gray level inversion. The WT is 
thus robust against image disturbance, such as low-frequency 
contrast non-uniformity as well as image noise, that does not 
change positions of local image extrema. 

[0051] Prior to image segmentation, it is a goal of image 

acquisition to produce a contrast between the structure of 
interest and adjacent structures. In presence of image non- 
uniformity, threshold based methods will fail. In case of steep 
image gradients within the structure of interest or of low 
gradients at their edges, gradient based methods will be likely 
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to give erroneous results. Both problems can be solved if the WT 
is directly applied to the original image data: Image non- 
uniformity usually consists of low frequency signals in the 
space domain and does not change the position of local image 
minima or maxima; and there are no restrictions regarding the 
image gradients when applying the WT, as long as the structure 
of interest is connected. 

[0052] In contrast to the WT applied to the gradient 

image, the present approach does not yield the object borders 
directly. Rather, the direct application of the WT is combined 
with a gray scale based post-processing. This can be a 
segmentation step to find object borders after the separation 
step; another possibility is the direct rendering (e.g., in 2D 
or 3D) of the result, e.g. using transfer functions; finally, a 
histogram analysis can be performed after the WT, e.g., in order 
to assess accurate object volumes taking into account partial 
volume effects, noise and image non-uniformity. 

Connectivity 

[0053] A basic assumption regarding the objects of 

interest is their connectivity. Two points of a gray level image 
are connected, if a path of adjacent voxels exists between them 
that is at least as bright as the darker one of the two points. 
When interpreting the image intensity as height information, the 
emerging landscape consists of several hills corresponding to 
bright image areas and valleys corresponding to dark image areas 
(Fig. 2 left) . According to the above definition of 
connectivity, two regions are disconnected, if they are sepa- 
rated by a valley, otherwise they are connected. 

[0054] In other words, applying the WT to the original 

data solves a difficult part in image segmentation, i.e. 
separating the structure of interest from neighboring structures 
that exhibit similar image intensities. The separation is 
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performed at the most likely line between the two structures. 
E.g., if two dark regions are grouped on two opposing sides of a 
slightly brighter line, this line is the place where the 
watershed will be constructed, when applying the WT to the 
original image data. The two regions will be attributed to two 
different basins. 

[0055] For the segmentation of three-dimensional objects, 

the three-dimensional WT is applied directly to the original 
image data. Three-dimensional WT refers to a generalization of 
the two-dimensional WT; a three-dimensional structuring element 
takes into account at least the six nearest neighbors of a voxel 
in order to assess its connectivity. Similarly, we propose to 
use a four-dimensional WT for the segmentation of a series of 
three-dimensional images, taking into account the eight nearest 
neighbors (two in each dimension, the fourth dimension being 
regarded as time) . This can be extended to arbitrary dimensions, 
by adding the two respective nearest neighbors to the 
connectivity rule. 

[0056] For multispectral data, where more than one scalar 

value is available for each spatial position, much as it is the 
case in color images (there usually three channels or, resp., 
spectra are used) , a scalar function may be derived by 
combination of these channels. This scalar function then serves 
as input data for the WT. 

[0057] Such derived images can also be computed from a 

single channel, e.g. from a single 3D image from a CT scanner. 
One example is the gradient function image. More basic examples 
are any kind of gray scale transformations (or transfer 
functions). These can, e.g., aim towards assigning the darkest 
gray values to the object of interest, such that surrounding 
tissues are brighter and the places where watersheds are 
preferably constructed. Finally, the input image can be 
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morphologically transformed, e.g. by erosion, dilation, closing, 
or opening in order to suppress thin structures. 

Controlling Oversegmentation - Preflooding and Markers 

[0058] The major problem of the WT is oversegmentation, 

i.e. the tendency to produce too many basins. The invention 
provides two methods to overcome this problem. First, 
preflooding can be applied to merge small basins with adjacent 
basins before constructing watersheds. Second, basins can be 
marked individually in order to determine the number of basins. 

[0059] The preflooding concept uses a measure of depth of 

a certain basin. As long as this depth is greater than a 
specific preflooding height, the basin will be separated from 
neighboring deeper basins via the construction of watersheds. If 
not, it will be merged with the neighboring basin of highest 
priority. Priority is, e.g., equivalent to absolute basin depth. 

[0060] In order to account for contrast variations at 

different image areas, the preflooding height can be a function 
of the local image characteristic, such as brightness or 
variance. This function may depend from one ore more parameters. 

[0061] In order to realize a system, where the parameters 

of the preflooding concept may be chosen interactively in real- 
time, a hierarchical WT is preferred. For each merging 
procedure, e.g., the depth of the merged basins can be measured 
relatively to the lowest point on the separating crest line (the 
merging point, where the watershed would be constructed) . These 
merging procedures can be stored in their order of occurrence or 
hierarchy in a list without merging any basin at this pre 
processing step. Afterwards, for an actual choice of parameters, 
processing this list (in combination with the segmented image 
containing elementary basins) suffices to perform the desired WT 
with preflooding. 



[0062] Another possibility to control oversegmentation is 

using markers. An arbitrary number of different markers can be 
assigned to arbitrary voxels prior to the transformation (or, in 
case of a hierarchical approach, prior to processing the list 
containing the merging information) . Then, two basins are only 
merged, if they are not marked differently. When merging a 
marked basin with an unmarked one, the latter will also receive 
the marker of the former. After the WT, all voxels will be 
assigned to one of the markers - i.e. the number of different 
regions is determined by the number of different markers. Point 
markers applied to single voxels are suitable for most cases. 

Automation 

[0063] The preflooding concept is dependent on one 

parameter (constant preflooding height) or a few parameters 
(preflooding height as function of local image characteristics) . 
If the problem is known, the proper (set of) parameter (s) can be 
determined automatically. The number of basins is monotonically 
decreasing with increasing preflooding height whereas its sizes 
are increasing. This provides the basis for robustness. The 
segmentation result does not change continuously with varying 
preflooding height but, depending on the image quality, a more 
or less broad range of proper values exists. The range of proper 
preflooding heights can be easily found as the first plateau 
(regarding the size of the largest emerging basin) larger than a 
certain threshold (the brain can be assumed to be larger than, 
e.g., 500 ml for adults). 

[0064] On the other hand, the basin of interest may be 

found automatically using some basin features regarding size, 
shape, position, mean and minimum image intensity, etc. For the 
skull stripping problem, e.g., the brain is often the largest 
existing basin, as soon as the proper preflooding height is 
found. 
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[0065] However, automation should not be the ultimate 

goal if large variations in the shape or anatomy are expected. 
An interactive segmentation, e.g. with the above mentioned 
marker concept, is often a better choice for pathological or 
abnormal cases . 

Volumetry 

[0066] An important application of the present invention 

is volumetry, i.e. the reliable estimation of the volume of 
structures of interest. The invention evaluates to separate 
objects from neighboring objects, including some of the 
intermediate tissue (that is brighter or darker than the two 
objects); most object borders are included in the segmentation 
results including the unsharp region or the partial volume 
region . 

[0067] Furthermore, for volumetry purposes it is 

desirable to include into the segmentation as much as possible 
of the adjacent structures showing a different gray scale, 
since, e.g., a histogram analysis that aims to account for 
partial volume effects profits from the knowledge of all tissue 
types involved. Typically, such histogram analysis is very 
robust and can be performed fully automatically. 

[0068] In general, after applying the WT, it is essential 

to preserve the original image information of the basin of 
interest . 

[0069] If parts of a connected structure should be 

quantified, cutting tools (like planes, discs, or hyper-planes 
in 4D) , are used to partition (separate) this structure. 



[0070] In order to reduce noise and artifacts, or to 

increase or decrease the amount of information in an image, 
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resampling and interpolation of the original image data can be 
applied prior to the WT . 

[0071] Dynamic volumetry, i.e. comparison of the volume 

of objects at different time points, can be performed by 
applying the results of the WT of one time point to all time 
points, whereas the gray scale based post-processing (in 
particular, histogram analysis) could be performed independently 
for each individual time point. 

[0072] The invention is particularly advantageous in that 

it enables to realize a computer system to perform segmentation 
which features: 

a. Robustness even against considerable image non- 
uniformity, image noise, other image disturbances, and 
low image resolution; 

b. High speed realizations with computational costs that 
are approximately of the order of the number of 
picture elements; 

c. Interactive segmentation with small interaction costs 
(few point markers on the whole 3D or 4D volume often 
suffice to separate the structure of interest from all 
other structures that exhibit the same gray scale 
characteristic) ; 

d. High quality and reproducibility of derived measures, 
such as ob j ect volumes ; 

e. No geometrical assumptions, respectively anatomical a 
priori knowledge, except the connectivity of 
structures of interest; 

f . Openness to the integration of geometrical models and 
anatomical assumptions; 

g. Fully automatic segmentation and interactive semi- 
aut oma t i c s egment a t i on ; 

h. Accurate volumetry, since object borders are included 
in the segmentation. 



[0073] It is to be noted that the application of the 

invention to a digital image provides one or more basins. One or 
more of such basins are further processed in a post-processing 
step. In some applications the basin or the basins are regarded 
as the segmentation result. This is the case for visualization 
where the post-processing is performed to transform the 
transparency of a region surrounding a region of interest within 
the basin. This is also the case for volumetry applications. 
However, in other applications of the invention the basin or 
basins which are initially identified are the basis for further 
segmentation. Further segmentation of a basin can be performed 
by means of any prior art gray-scale based segmentation method. 

Brief description of the drawings 

[0074] A more complete appreciation of the invention and 

many of the attendant advantages thereof will be readily 
understood from the following detailed description of preferred 
embodiments when considered in connection with the accompanying 
drawings , wherein : 

[0075] Figure 1: is illustrative of a preferred watershed 

transformation method in accordance with an embodiment of the 
invention, 

[0076] Figure 2: is a block diagram of an embodiment of a 

computer system for segmentation of a digital image, 

[0077] Figure 3: is illustrative of a basin-basin merging 

rule using preflooding, 



[0078] Figure 4: is a basin-basin merging rule being 

based on markers, 
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[0079] Figure 5: is a schematic example for an 

application of an embodiment of the present invention for 
segmentation and visualization by means of a transfer function, 

[0080] Figure 6: is a diagram being illustrative of the 

algorithmic behavior of the watershed transformation with 
increasing preflooding height relating the number of resulting 
catchment basins and the size of the largest basin to the 
relative preflooding height, 

[0081] Figure 7: is illustrative of an example for a 

model based histogram analysis for the purpose of volume 
measurement . 

Detailed Description of the Preferred Embodiments 

[0082] Figure 1 shows a preferred embodiment of a 

watershed transformation method in accordance with a first 
embodiment of the invention. 

[0083] In step 100 digital image data is loaded. The 

present invention is not restricted with respect to the kind of 
image data and is broadly applicable to a variety of classes of 
image data. The present invention is particularly useful for 
medical image data, in particular digital image data acquired 
from computer tomographic images, magnetic resonance images and 
Tl-weighted magnetic resonance images. Further it can be 
advantageously employed to other radiologic images. 

[0084] In step 102 a data entry is generated for each 

picture element of the digital image data. Typically digital 
image data comprises a large number of picture elements, i.e. 
pixels for two-dimensional images and voxels for three- 
dimensional images. In addition there are also four-dimensional 
images where the fourth dimension is time. Such four-dimensional 
images are acquired for observation of ongoing processes like 



- 21 - 

the flow of a fluid, the volume change of a structure of 
interest, or the pulsation of a human heart. 

[0085] In the latter case a time series of three- 

dimensional image data is acquired and stored in a four- 
dimensional array. In the following the term "picture element" 
will be used to designate a pixel or a voxel or a voxel at a 
specific time depending on the kind of digital image data. 

[0086] For each such picture element a data entry is 

generated comprising a pointer to a basin and a marker value. 
Both the pointer to a basin and the marker value are initialized 
to be equal to zero. This indicates that no pointer to a basin 
and no marker value are present in the data entry. It is to be 
noted that the "marker value" in the data entry is optional and 
is only used if a user can interactively mark individual picture 
elements or if automatic marking is used. 

[0087] In step 104 a scalar value is determined for each 

picture element. In one embodiment the scalar value is identical 
to a color coordinate or a gray value of the picture element. In 
this case the scalar value is already present in the picture 
element data itself. 

[0088] In another embodiment the scalar value is 

determined by evaluating all data belonging to a picture element 
by means of a transformation function. It is important to note 
that the scalar value is determined in this case only taking 
into consideration the picture element data of the picture 
element for which the scalar value is to be determined but not 
of other picture elements. 

[0089] For example, the scalar value can be calculated by 

transforming the color coordinates, the gray value and / or 
other pictorial information contained in the picture element. 
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[0090] In step 106 the picture elements are sorted. The 

sorting criteria is the scalar value determined in step 104. 
This results in a sequence of picture elements p ± . In the example 
considered here the scalar value is the gray value of a picture 
element and the picture elements are sorted in ascending order, 
i.e. the darkest picture element having the lowest gray value is 
the first picture element p 0 in the sequence and the brightest 
picture element having the highest gray value is the last 
picture element p n -i in the sequence. 

[0091] In step 108 the index i is initialized to be zero. 

This way, a loop is started for processing all picture elements 
in the sequence determined in step 10 6 starting with the first 
picture element p 0 in the sequence. 

[0092] In step 110 a set of picture elements N(p ± ) is 

determined. All picture elements pj contained in the set N(pi) 
are neighbors of the picture element p ± in the original digital 
image and the gray values of those neighboring picture elements 
Pj are lower than the gray value of the considered picture 
element pi. In other words the indices j of the picture elements 
Pj is lower than the index i of the considered picture element p it 

[0093] For the first picture element p 0 in the sequence 

determined in step 106 the set N(p ± ) is empty as - by definition 
- there are no picture elements pj for which j < i. However, for 
other picture elements p ± with i > 0 picture elements Pj in the 
neighborhood of pi which have a lower gray value than pi can be 
found . 

[0094] In step 112 it is determined whether the set N is 

empty or not. If the set N is empty the control goes over to 
step 114. In step 114 a new basin is created with the picture 
element p± as a seed point. In step 116 the index i is 
incremented and in step 118 it is determined if i equals n. 
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[0095] If this is not the case the control goes back to 

step 110 with the incremented value of the index i for 
processing of the consecutive picture element of the sequence 
determined in step 106. 

[0096] If i equals n the control goes to step 120 as all 

picture elements p ± of the sequence have been processed. In step 
12 0 a step of post-processing of the watershed transformed image 
is then performed. The kind of post-processing step or steps 
depends on the desired application such as visualization, 
segmentation or quantification, e.g. measurement of volumes. The 
post-processing performed in step 12 0 will be explained below in 
greater detail. 

[0097] In case the set N(pi) is not empty the control 

goes from 112 to 122. In step 122 a set of basins BA(pi) is 
determined. This set BA contains the basins B(pj). A basin B(pj) 
is a basin having a picture element pj of the set N. In other 
words the basins for all picture elements Pj contained in the set 
N are determined. Those basins B(pj) form the set BA(pi). 

[0098] In addition - if the marker concept is used - it 

is tested, if a basin B(pj) has a marker. If the basin B(pj) has 
no marker or the same marker as the considered picture element pi 
or if the considered picture element pi has no marker the 
corresponding basin B(pj) is defined to be a member of the set 
BA(pi) . If the contrary is the case the basin B(pj) does not form 
part of the set BA(pi) . 

[0099] In other words, step 122 serves to determine the 

set of basins containing at least one of the neighboring picture 
elements pj of pi as candidates for a merging operation. 



[0100] In step 124 it is determined whether the set 

BA(pi) is empty. If this is the case the control goes to step 114 



- 24 - 

to create a new basin with pi as seed point as there are no 
basins with which the picture element pi could be merged. 

[0101] If the contrary is the case step 126 is carried 

out to determine the basin B* of the set BA(pi) having the 
highest priority. In this example, priority is defined such that 
the basin B* having the highest priority is the basin of the set 
BA(pi) having the darkest seed point, i.e. the seed point with 
the lowest gray value. 

[0102] In step 128 the picture element p± is merged with 

the highest priority basin B*. This way the picture element pi is 
assigned to the basin B* and forms part of this basin B* from now 
on . 

[0103] In step 130 one or more basin-basin merging rules 

are applied with respect to pairs of basins formed of basins B k 
contained in the set BA(p±) and the highest priority basin B*. 
Those pairs of basins B k and B* to which one of the basin-basin 
merging rules applies are merged, i.e. B k is merged with B . In 
case that more than one basin-basin merging rule is used, in a 
preferred embodiment of the invention, all basin-basin merging 
rules have to apply with respect to a pair of such basins as a 
precondition for merging of the basins. Examples of basin-basin 
merging rules will be explained in greater detail with respect 
to Figures 3 and 4 . 

[0104] The index i is incremented in step 116. 

[0105] After the index i has reached the value n the 

whole digital image data has been processed and the watershed 
transformation is complete resulting in one or more basins 
identified in the original image. 



[0106] It is to be noted that the above described 

watershed transformation method in accordance with the invention 
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is particularly advantageous in that it is robust for a large 
class of different digital image data and also prevents 
oversegmentation. However, other watershed transformation 
methods can also be used including watershed transformation 
methods known from the prior art. 

[0107] Figure 2 shows a block diagram of an embodiment of 

a computer system of the invention. The computer system 2 00 has 
a memory 201 for storage of digital image data. The memory 201 
is organized as an n-dimensional storage array. 

[0108] In the example considered in Figure 2 the storage 

array of the memory 201 is two-dimensional; for other 
applications the array can have three, four or more dimensions. 
To perform an access operation to the memory 2 01, the memory 201 
has a pointer 202 pointing to a memory element 2 03 providing an 
entrance point into the storage array. 

[0109] When a digital image is loaded into the computer 

system 200 (cf . step 100 of Figure 1) the individual picture 
elements of the digital image are stored in corresponding memory 
elements of the memory 201. Preferably the topological ordering 
of the picture elements is preserved when the digital image data 
is loaded into the memory 201. In other words, if a picture 
element p± is stored in memory element 204 the neighboring 
picture elements of this picture element pi are stored on 
neighboring memory elements 205 to 212 while preserving the 
original topology. 

[0110] Further the computer system has a memory 213 for 

the storage of a list of addresses of memory elements within the 
storage array of the memory 201. The list of addresses 
corresponds to the sequence of picture elements p 0 , Pi, - Pi 
p n _i as explained above with respect to step 106 (cf . Figure 1) . 
In the example considered here the top most element of the list 
stored in the memory 213 identifies the first picture element p 0 
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in the sequence by means of the storage address of its 
corresponding memory element within the memory 201. Likewise the 
following addresses identify consecutive picture elements pi, p 2 ... 
of that sequence. 

[0111] In case the scalar value of each picture element 

(cf. step 104 of Figure 1) is not contained in the picture 
element data itself an additional entry is required in the list 
stored in the memory 213 for the scalar value. This way the 
scalar value of each picture element within the sorted sequence 
can be directly accessed. 

[0112] Further the computer system 200 has a memory 214 

for storage of a basin list. The basin list of the memory 214 
has four entries per basin. The first entry is the basin number, 
the second entry is the scalar value of the seed point of this 
basin, the third value is the marker - if the marker concept is 
used - and the fourth entry is a link. 

[0113] Initially the list stored in the memory 214 is 

empty. When the first basin is created in step 114 a 
corresponding entry is made into the list of memory 214. For 
example, the first basin number is zero. 

[0114] The scalar value of the seed point p± is entered 

as well as the marker value of the seed point p± - if any. 
Further a link is entered by entering the basin number "zero", 
such that the basin with the basin number zero points to itself. 
For merging this basin with another basin the link is changed to 
point to the basin number of the basin with which this basin is 
to be merged. 

[0115] When the next basin is created in a subsequent 

step 114 (cf. Figure 1) a consecutive entry in the list stored 
in memory 214 is created with a consecutive basin number. 
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[0116] The computer system 200 further contains a memory 

215 for storing a list of entries for each picture element. This 
corresponds to step 102 of Figure 1. Each entry in this list 
contains an indication of the basin number to which a picture 
element pi is assigned and its marker value - if any. 

[0117] Instead of using two entries per picture element 

it is also possible to use only one entry per picture element in 
the list stored in the memory 215 if ranges are defined for 
basin numbers and for marker values. For example, if a first 
range has been defined for basin numbers and a second range has 
been defined for markers still the complete information is 
available : 

[0118] If the number stored in the list is within the 

first range, this means that the picture element p± has already 
been assigned to a basin. In this case no marker value needs to 
be given within the list stored in memory 215 as the marker 
value for this picture element p ± is the same as the marker value 
stored in the marker entry of the list of memory 214 for the 
basin to which the picture element p± is assigned. 

[0119] If the value stored in the list belongs to the 

second range, this implies that the corresponding picture 
element pi has not yet being assigned to a basin. When the 
corresponding picture element pi is assigned to an existing basin 
(cf . step 128 of Figure 1) or if a new basin is created with the 
picture element p x as seed point (cf . step 114 in Figure 1) the 
marker value of the picture element pi is stored in the basin 
list of memory 214 so that the marker information in the list of 
memory 215 becomes redundant. This makes it possible to replace 
the marker information in the list stored in memory 215 by the 
basin number to which the picture element p± is assigned. 



[0120] Further the computer system 200 has a computer 

program 216 which comprises a variety of program components for 
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the performance of the watershed transformation method and post- 
processing of the transformed image data. Further the computer 
system 200 has a monitor 217 for the display of the original 
image data and / or the results of the watershed transformation 
and post-processing operations for visualization, segmentation, 
or quantification . 

[0121] In operation, the digital image data of the image 

to be processed is loaded into the memory 201. The list stored 
in the memory 215 of entries for the picture elements is empty 
or contains a number of markers - if the marker concept is used. 
A user can interactively mark certain picture elements of the 
digital image data by clicking on the displayed picture elements 
on the monitor 217. Alternatively or in addition, automatic 
marking can be employed. 

[0122] The lists stored in the memories 213 and 214 

initially are empty. 

[0123] First the picture elements stored in the memory 

201 are sorted in ascending order whereby the scalar value of 
the picture elements is the basis for the sorting operation. 
This way the list stored in the memory 213 is created by a 
corresponding component of the program 216. 

[0124] The sequence of picture elements identified by the 

list stored in memory 213 is processed starting with picture 
element p 0 - This is done by another component of the program 216. 

[0125] By processing the picture elements p± a list of 

basins is created in the memory 214 and merging operations 
between picture elements and basins can take place as reflected 
by the corresponding list stored in memories 215 and the link 
entry of the list stored in memory 214. 
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[0126] For example, if the basin with the basin number r 

is to be merged with the basin having the basin number s, this 
is done by setting the link entry of the basin with the number r 
to s. In order to determine the basin to which a given picture 
element p± belongs it is therefore necessary to use the basin 
number stored in the list of memory 215 for that picture element 
pi as an entry point into the list stored in the memory 214. 

[0127] If the entry for this basin number in the list of 

memory 214 has a link pointing to the same basin number no 
further steps need to be performed. If, however, the link points 
to another basin number, this other basin number needs to be 
checked for another link and so forth until an entry in the list 
of the memory 214 has been identified with a link pointing to 
itself. This entry identifies the basin to which the picture 
element pi actually belongs. 

[0128] Alternatively it is also possible to update the 

list stored in the memory 215, i.e. the basin numbers assigned 
to the picture elements, each time a basin-basin merging takes 
place. In this case the links of the list stored in the memory 
214 become redundant. Depending on the implementation this 
requires more processing time in comparison to the first 
alternative using the links. 

[0129] As a result of the watershed transformation of the 

image data loaded into the memory 2 01 one or more basins are 
identified automatically. This way each memory element of the 
memory 201 is assigned to such a basin. When the next image is 
loaded into the memory 2 01 the position of the basin or the 
basins can be maintained such that no watershed transformation 
has to be performed again. Rather post-processing of the basin 
or the basins can immediately start. This is particularly 
advantageous for volumetry and quantitative comparison of 
changing volumes over time within the same basin. 
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[0130] Figure 3 shows an example for a basin-basin 

merging rule (cf. step 130 of Figure 1) using pref looding . 

[0131] For example, for application of the basin-basin 

merging rule depicted in Figure 3, the control goes from step 
130 of Figure 1 to step 300 of Figure 3. In step 300 the 
difference A between the scalar value of the picture element pi 
and the scalar value of the seed pixel of the considered basin B k 
is determined. In step 302 it is determined whether the 
difference A is lesser or equal than the pref looding height- The 
preflooding height is a parameter value which can be selected by 
the user or which can be set automatically. It is advantageous 
to set the preflooding height while taking into consideration 
the nature of the digital image data to be processed. 

[0132] If the difference A is in fact lesser or equal to 

the preflooding height the basins B k and the highest priority 
basin B* are merged in step 304. In the computer implementation 
of Figure 2 this means that the link entry in the list of memory 
214 for the basin B k with the basin number k is set to the basin 
number of the highest priority basin B*. This way it is not 
necessary to update the list stored in memory 215 as the correct 
assignment of each picture element pi is properly identified by 
the link entries. 

[0133] After step 304 step 306 is performed to increment 

k in order to test the preflooding basin-basin merging rule with 
respect to another pair of basins B k and B* . 

[0134] If the difference A is greater than the 

preflooding height no merging of basins takes place and the 
control goes from step 302 directly to step 306. 

[0135] Figure 4 shows a marker based basin-basin merging 

rule which can be employed alternatively or in addition to the 
preflooding basin-basin merging rule of Figure 3. 
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[0136] For example, the control can go directly form step 

130 of Figure 1 to step 400 of Figure 4 or alternatively the 
control can go to step 400 after the preflooding basin-basin 
merging rule of Figure 3 has been tested for all relevant pairs 
of basins. 

[0137] In step 400 the markers of a pair of basins B k and 

the highest priority B* are determined, if any. In the computer 
implementation of Figure 2 this can be done by reading the 
corresponding marker values from the list stored in the memory 
214. 

[0138] In step 402 it is determined whether both basins 

B k and B* have markers and if these markers are different. If 
this is not the case the control goes to step 404 in order to 
merge the basins B k and B*. Again this merging operation can be 
performed by setting the link entry of the basin B k in the list 
stored in memory 214 (cf . Figure 2) to the basin number of the 
highest priority basing B* . 

[0139] Step 406 is performed in order to increment the 

index k such that the marker based basin-basin merging rule is 
employed with respect to a consecutive pair of basins B k and B . 

[0140] In case it is determined in step 402 that both 

basins B k and B* have markers and that these markers are 
different, no basin-basin merging is performed an the control 
goes directly to step 406. 

[0141] Figure 5 shows a schematic example of an 

application of the present invention for the purposes of 
segmentation and visualization. 



[0142] Figure 1 schematically shows a medical image 500 

such as a MR image of the skull. The medical image 500 has a 
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region 501 which shows the brain and a region 502 which 
surrounds the region 501. The region 502 contains the cranial 
bone . 

[0143] The diagram 504 shows the profile of gray values g 

within the medical image 500 along the x-axis . In this example, 
the x-axis traverses the center of the object. With respect to 
the skull depicted in medical image 500 it is known that the 
gray values within the region 501 vary between a and b whereas 
the gray values within the region 502 vary between c and d. 
Further there are partial volume effects, i.e. picture elements 
which partly cover tissue types of the regions 501 and 502 and 
which have therefore gray values g between b and c. 

[0144] In the example considered here the watershed 

transformation method of Figure 1 is employed to the medical 
image 500. This produces a basin which is circumscribed by the 
crest line 503. This basin contains the entire region 501 and 
also a part of the region 502. The crest line is formed by 
connected local maxima of the gray values g of the medical image 
500. 

[0145] For visualization of the region 501 it is 

desirable to remove the parts of the region 502 within the basin 
as circumscribed by the crest line 503. This is especially 
important for three-dimensional or four-dimensional display as 
in those cases the outer parts of the region 502 within the 
basin would cover the region 501 which would make the region 501 
invisible on the monitor. 

[0146] To remove those parts of the region 502 a transfer 

function is utilized. The transfer function 505 serves to 
transform the opacity of picture elements having gray values g 
below b. Opacity is the inverse of transparency. An opacity of 
zero corresponds to total transparency which makes the 
corresponding picture elements invisible on the monitor. 
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[0147] Those picture elements having gray values below c 

are made completely transparent in order to make them invisible. 
Picture elements having gray values between c and b are 
transformed by applying a linear function to gradually increase 
the opacity from zero. Picture elements having a gray value of b 
and greater are not modified and the opacities of those picture 
elements remain unaltered. In other words, the opacity t(b) of a 
picture element having a gray value of b remains unchanged as 
well as the opacities of picture elements having gray values 
bigger than b. 

[0148] By applying the transfer function 505 as depicted 

in Figure 5 picture elements of the basin having gray values 
between d and c are made completely transparent and thus 
completely invisible on the monitor whereas the transparency of 
the picture elements is gradually increased between picture 
elements having gray values ranging from c to b. 

[0149] With respect to the diagram 504 this has the 

following effect: The basin which is limited by the crest line 
503 extends between xi and x 4 along the x-axis. Those picture 
elements of the basin which are located in the border region 
between xi and x 2 and between x 3 and x 4 are transformed by means 
of the transfer function 505 in order to make the region 501 
visible for three- or four-dimensional display. 

[0150] In order to perform this transformation it is 

advantageous to use the RGBA color coordinate system as the A- 
coordinate directly determines the opacity t of a picture 
element . 

[0151] Figure 6 shows the algorithmic behavior of the 

watershed transformation method of Figure 1 with increasing 
preflooding heights. The number of resulting catchment basins is 
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depicted by the solid line and the size of the largest basin by 
the dashed line. 

[0152] The modified watershed transformation (WT) 

included in the present invention is a powerful tool for 
segmenting the whole brain from MR1 datasets. The application to 
the original MRI data rather than the gradient image is an 
approach which works excellent for brain segmentation with 
images showing an intensity characteristic comparable to the MR 
Tl-signal. In particular, it is important that gray matter (GM) 
is at least as bright as cerebro spinal fluid (CSF) but not 
brighter than white matter (WM) . 

[0153] Assuming that the brain is surrounded by CSF and 

all non-brain tissue that needs to be removed shows brighter 
image intensities than CSF, skull stripping becomes equivalent 
to isolating a single hill in the four-dimensional landscape. 
This hill then represents the whole brain including the 
cerebellum and the spinal cord, as long as they are connected 
within the image. The valley corresponding to CSF and other low 
intensity tissue such as bone and meninges will then define the 
accurate border of the segmentation result. 

[0154] After the transformation with an appropriate 

preflooding height one basin should exist that represents the 
whole brain with all parts that are connected via WM, i.e. 
cerebellum and spinal cord. This basin usually is the largest 
existing one, unless the field of view (FOV) has been selected 
too large. In that case, one could choose the basin containing 
the center of FOV or a manually selected basin. 

[0155] Here the significance of the intensity 

characteristics of Tl-weighting for this approach becomes 
clearly visible: The connected WM is surrounded by darker GM and 
even darker CSF, and can thus be regarded as the top of a hill. 
WM regions may not be connected in 2D, but must be connected in 
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3D, since the WM is interconnecting all functional parts of the 
brain. Therefore, a segmentation procedure is preferred that 
works fully in 3D, rather than 2D techniques that may not 
account for three-dimensional connectivity (even if they are 
combined with extrapolation or interpolation techniques) . 

[0156] Figure 7 shows a model based histogram analysis 

for automatic quantification, such as of CSF volumes. The model 
comprises three independent Gaussian functions and partial 
volume functions as will be explained in more detail in the 
following . 

[0157] The histogram depicted in Figure 7 relates to a 

basin as identified as a result of the watershed transformation 
method, e.g. by employing the method as explained above with 
respect to Figure 1. With respect to the example considered in 
Figure 5 this means that the basin contains the region 501 as 
well as parts of the region 502 surrounding the region 501. 

[0158] In the case of a brain those regions comprise a 

variety of tissue types: cerebro spinal fluid (CSF), white 
matter (WM) , gray matter (GM) , and possibly veins and bone. To 
model each of these tissue types a function <p is used. The 
corresponding Gaussian functions for CSF, GM and WM are depicted 
in the histogram of Figure 7 . 

[0159] In addition a partial volume function p a p is used 

to model the partial volume effects of picture elements covering 
two different kinds of tissue types. The dashed lines in the 
histogram of Figure 7 depict two of such partial volume 
functions by way of example. 

[0160] Based on the Gaussian functions for the tissue 

types and the partial volume functions a total fit function f(g) 
results which is fitted to the actual values of the histogram 
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h(g) in order to determine the parameters c ai s a , A a , Aap of the 
model. The corresponding formulas are given below: 



(x) = y dx' (2) 



fttffo) = 7~Z~ r { } 

fti) = S A « • + S ^ • v«M ( 4 ) 



E(c a ,s aj A aj A^; h(g)) = /j-^ XI [/(<?) ~M<?)]' 

V 11 



(5) 



a, jS: tissue types, one of white matter (WM) , gray matter 

(GM) , cerebro spinal fluid {CSF), bone, etc.; condition for (J: c$ 
> c a . 

c a : mean gray value for tissue type a. 

s a : width of distribution for tissue type a. 

A a : number of pixels for tissue type a. 

A a p: number of pixels for partial volume distribution of tissue 
types a and /3. 

<p a (x) : normal distribution of tissue class a. 

$ a (x) : error function (i.e. integral over <p a {x)) used as 

auxiliary function for partial volume approximation. 

p a p(g) : mixed Gaussian used as approximation of partial volume 

distribution between tissue types a and /3. 

f (g) : total fit function. 

h (g) : histogram. 

E: root mean square deviation of total fit function and 
histogram that is to be minimized. 



[0161] Fitting the model function to the discrete histogram 
function is performed by minimizing mean square deviations, e.g. 
via a robust simplex algorithm or a gradient-descent algorithm, 
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such as the Marquart-Levenberg algorithm. Good initial values 
are derived from the two most prominent histogram peaks . Once 
the fit converges, quantifying the corresponding CSF volume is 
straightforward. A cutoff value is computed such that 50 % of 
the partial volume voxels are added to the result. For each 
individual measure, the expected volumetric uncertainty is 
calculated based on the image quality, i.e. image noise and non- 
uniformity, as reflected in the histogram. 

[0162] In the example considered here three different 

partial volume functions are used for the partial volumes of 
CSF, GM; CSF, WM; and GM, WM. The latter is not depicted in 
Figure 7 . 

[0163] As the Gaussian distribution functions <p a {x) and 

<Mx) are normalized the volume of CSF within the considered 
basin is calculated as A C sf + 1/2 Acsf, gm + 1/2 A C sf, m* 

[0164] In addition, the remaining differences between the 

total fit function and the values of the histogram can be taken 
into account for exact determination of the volume. 

[0165] A listing of the reference numerals found in the 

drawings and throughout the specification is now provided. 
While various descriptions of the present invention are 
described above, it should be understood that the various 
features can be used singly or in any combination thereof. 
Therefore, this invention is not to be limited to only the 
specifically preferred embodiments depicted herein. 

[0166] Further, it should be understood that variations 

and modifications within the spirit and scope of the invention 
may occur to those skilled in the art to which the invention 
pertains. Accordingly, all expedient modifications readily 
attainable by one versed in the art from the disclosure set 
forth herein that are within the scope and spirit of the present 
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invention are to be included as further embodiments of the 
present invention. The scope of the present invention is 
accordingly defined as set forth in the appended claims. 
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Claims 

1. A method for segmentation of a digital image being 
represented by picture element data, the method comprising 
the steps of: 

- performing a watershed transformation of the digital 
image by applying a watershed transformation method on 
the picture element data to provide one ore more 
basins , 

- post-processing of at least one of the basins by 
processing the picture element data belonging to the 
at least one of the basins, where the post-processing 
is selected from the group of post-processing steps 
consisting of: 

i. gray-scale based segmentation, 
ii. visualization using a transfer function, 
iii. volumetry using histogram analysis. 

2. The method of claim 1 further comprising: 

- determining a scalar value for each picture element by 
applying a transformation onto the corresponding 
picture element data, 

- applying the watershed transformation on the scalar 
values to provide the one or more basins. 

3. The method of claim 1 whereby the digital image is a 
medical image, comprising one or more anatomical structures 
to be segmented. 



